Chapter 2 

Phonetic Description 

Cantonese presents an interesting case of relatively simply^ 
phonetics -- especially with respect to segments. We shall not attempt 
a description of fine phonetics but rather a broad outline of phonetic 
facts sufficient for presenting the phonological analysis. For 
example, we shall not emphasize how the individual initial consonantal 
sounds vary in pronunciation when they are followed by different types 
of vocalic sounds, 1 or how, depending on the preceding or following 
tone, the tonal contour of the same tone is modified, 2 unless the 
modification is significant and consistent enough to produce a case of 
tone sandhi. Minute phonetic differences are in many cases universally 
predictable and thus need not be included in our description of 
Cantonese. Moreover, what is described here pertains to sounds heard 
or understood, either in isolation or in sequence, under highly 
idealized conditions affected by no distractions of any kind, such as 
might be expected of utterance in normal discourse . 3 

2.1 Syllable structure 

Traditional Chinese phonologists always described sounds of a 
Chinese syllable in terms of an initial-final dichotomy. This is in 
fact quite reasonable, since most of the morphemes in Chinese are 
monosyllabic and moreover, the structure of the syllable is relatively 
simple. By "initial" is meant the initial consonant of a syllable, 
and by "final" the rest of the syllable constituents including tone. 
However, for convenience of discussion here, we single out the tone from 
the final and discuss it in a separate section. The initial may be 
optional, but its absence is often described more positively as "the 
zero initial". 4 The final consists of an optional medial (nonsyllabic 
vowel), a syllabic segment and an optional ending. In Cantonese, the 
initial, if present, is either a consonant or a glide; 5 the medial, if 
present, is always a glide; 6 the syllabic segment is usually a vowel, 
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although it may also be a nasal consonant if it is the only constituent 
in the syllable; while the ending, if present, is either a glide, a 
nasal or a stop consonant. Excluding tone, the Cantonese syllable can 
be tentatively described by the following formula: 7 

fc, 
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) V ( 



) 



where C stands for consonant, G for glide and V for vowel. The 
subscript 1 refers to prevocalic segments and the subscript to 
postvocalic segments. The formula, however, excludes the case where the 
syllable consists solely of a syllabic nasal. Segments thus combine to 
form the following syllable types according to the above formula: 

V C 1 V C 1 VC 2 
G 1 VS C 1 VG 2 

vc 2 Gl vc 2 



VG 2 G 1 VG 2 



2.2 Initials 



As can be deduced from our formula, the initial segment of a 
Cantonese syllable can be either a consonant or a glide. The Cantonese 
initials are tabulated in Table 6: 

Table 6 Initials 
Labials p p » m f 

Dentals t t' n 1 

Alveolars/ ts/t/ ts'/t/' s// 

Palato-alveolars 

Velars k k' q h 

Labialized Velars k k f 
w w 

Glides (or semi- u i f (?) 

vowels) 

The stops and affricates are dichotomized into a voiceless 
unaspirated series and a voiceless aspirated series. The closure of 
the stops and nasals is strong. The sibilants are usually pronounced 
with dental articulation in the author's idiolect; other speakers 
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vary from a dental (hissing sound) to a palatal articulation (hushing 
sound) . Often these sibilants are pronounced with some degree of 
palatalization when they precede high front vowels. 8 However, the 
existence of the distinction dental -versus -palatal with respect to 
these sibilants in the standard dialect is more artificial than real. 9 
The labialized velars, strictly speaking, may be heard as two closely 
knit segments, but since syllables beginning with them are different 
from for example, Mandarin syllables with velar initials followed by 
an [ft] glide, the Cantonese labialized velars are generally taken as 
single labialized initials, although the phonological structure of 
Cantonese may not agree with this analysis. 10 A glottal stop may 
appear at the beginning of a syllable without any initial consonant 
and may substitute for the smooth vocalic ingress. It is usually 
omitted from the list of initials. With respect to the glide initials, 
[u] ranges from bilabial to something between bilabial and labiodental 
and [y] occurs only before front rounded vowels. As already noted by 
Jones and Woo 1912, these glides may be pronounced with considerable 
friction when they are followed by high vowels. 

Due to the influence of neighboring 9 dialects, there is a merger 
of certain initials in the speech of a number of Cantonese speakers. 
For example, [n] is merged with [l] 11 by some speakers and [q] is 
sometimes confused with the zero initial. 12 However, speakers of 
standard Cantonese (the majority) maintain the distinctions between 
these sounds consistently. 

2.3 Finals 



Table 7 gives an exhaustive list of all the finals (53 in all) in 
Cantonese. 
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Table 7 Finals 

a: A:i a:E Aim A:n a:q A:p A:.t A:k 
vi bu Bm Bn bq ^p vt Bk 

e: eiQ e:k 

y 
ei 

IQ ik 

i: i:u i:m i:n i:p i:t 

oe: ce:q oe:k 

0y 5z5n 0t 

o: 0:1 o:n o:q o:t o:k 

on 

UQ uk 

u: u:i u:n u:t 



y: y:n y:t 

? 



m 
1 



As already noted in section 2.1, the Cantonese final consists 
of a vowel, or a vowel and an ending, or simply a syllabic nasal. The 
first column of Table 7 shows that seven different kinds of vowels 
can occur alone as finals, and they are all long. On the other hand, 
vowels which combine with an ending to form a final are either long 
or short, although all the short vowels are regularly followed by an 
ending segment. 

One of the most prominent features in Cantonese phonetics is the 
"long-short" distinction of vowels, 13 which is closely related to the 
distribution of certain tones, as we shall see later. When the vowel 
of a syllable is long, the ending is comparatively short, and when the 
vowel is short, the ending is comparatively long, except when the 
ending is a stop consonant, in which case the vowel is always shorter 
than elsewhere. In general, there is a kind of complementary inter- 
play between the vowel and the ending. Thus, there are in fact three 
gradations of relative phonetic length among the vowels in syllables not 
with the Ru-tone: 14 longer when the ending is zero, long when the ending 
is short, and short when the ending is long. 
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In Jones and Woo 1912, another variety of the high front vowel 
is recorded: [i], which is preceded by sibilant initials in certain 
syllables. As pointed out by Chao, this pronunciation is due to 
influence from the Shun-de dialect. The interesting point is that 
most syllables transcribed with this [i] seem to have had mostly 
Division II or IV initials in Ancient Chinese, while those tran- 
scribed with [i] in Jones and Woo have had mostly Division III 
initials. 15 

The vowel [i] is sometimes identified as [e] , and [u] as [o] , 
as for example in Chao 1947. [a:] is sometimes transcribed as [a:] 
or [a:] . 

The possible endings of the final are the three glides [1] , 
[y] and [u] , the three nasals [m] , [n] and [q], and the three stops 
[PL [t], [k] . The stop endings are unreleased and voiceless. 
They are sometimes described as glottalized 16 or implosive. 17 

The combination of vowels and endings is rather lopsided. 
Only [a:] and [b] can combine with any of the endings except [y] , 
which occurs only with |>] . Apart from [a:] and [>] , the [i] glide 
combines with [e] , [o:] or [u:]; while the [t(] glide combines 
with [i:] or [o] . It is evident from Table 7 that 'finals with 
nasal endings are exactly matched in number by finals with correspond- 
ing stop endings. Again putting aside [a:] and [>] , the labial 
endings combine only with [i:]; the dental endings with [i:], [o:], 
W> [u:], [y:]; and the velar endings with [e:], [i], |>:], [oe:] 
and [u] . 

2.4 Tones 

Cantonese is known to have a rich tonal system. This is true 
as far as quantity is concerned. Traditionally, nine or ten tones 
are described for Cantonese. In Table 8 we shall list the tones with 
traditional terminology and their pitch contours in accordance with 
Chao's system. 18 
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Table 8 Tones 



Yin 
(Upper) 



Ping 
(Even) 



53 55 



Yang 

(Lower) 21 22 



Shang 
(Rising) 



35 



24 



Qu Ru 

(Going) (Entering) 
High, Middle 



44 5 

~ or 
33 3 



It is immediately apparent that, as far as pitch contours are 
concerned, the actual value of the Ru-tones which traditionally refer 
to syllables with stop endings may be identified with other tones: 
high Yin-Ru can be identified with the nonf ailing variant of Yin-Ping, 
middle Yin-Ru with the Yin-Qu, and Yang-Ru with Yang-Qu. 19 As far as 
length is concerned, the Ru-tones are shorter. Thus, the Cantonese 
tones can be described as forming two pitch groups: contour and register^ 
the former comprising both falling and rising tones. All have both 
higher and lower pitch subgroups. 

The Ping-tones are either falling or level, the Shang-tones are 
rising: while both the Qu- and the Ru- tones are level. Apart from 
the phenomenon described in section 2.5, the two values here given for 
the Ping-tones are in many cases predictable or interchangeable. 20 
For example, some speakers prefer the level variant of the Yang-Ping 
tone, probably because of the extreme low register of the low falling 
contour (although of course pitch register is a relative matter; 
despite the natural assumption that with its wealth of tone variation 
Cantonese will display a wide band of pitch, the range within a major 
5th is preferred to that of a major 6th in the normal speech of some 
Cantonese speakers) . 
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2.8.1 Tone sandhi 

Although Cantonese is rich in the number of tones, it is rather 
poor in tone sandhi. Except for unstressed syllables, almost every 
syllable is pronounced with the same tone in isolation that it bears 
in sequence. In normal discourse however, tone sandhi occurs with the 
Ping-tones or falling tones when followed by certain tones. A falling 
tone loses its falling contour and becomes a level tone if it is 
followed by another falling tone of the same pitch register. Thus, 
a high falling tone becomes a high level tone if followed by another 
high falling tone; and a low falling tone becomes a low level tone if 
followed by another low falling tone. In addition, the products of 
this tone sandhi have the same effect on the preceding falling tones. 
Namely, a high falling tone becomes a high level tone if followed by a 
high level tone which is the product of a tone sandhi; and a low 
falling tone becomes a low level tone if followed by a low level tone 
which is the product of a tone sandhi. These are the conditions for 
tone sandhi common to the falling tones. The high falling or Yin- 
Ping tone is subjected to tone sandhi under two other conditions: 
it becomes a high level tone if followed by the high Yin-Yu tone 
or by the high level changed tone too. However, it is quite apparent 
that the general condition for tone sandhi is the pitch level at which 
a falling tone begins. We can summarize the condition as follows: 
a falling tone becomes a level tone if followed by another tone that 
begins at the same level, whether the latter is level or falling. 
Thus: 



53 


— * 55 / 53/55/5 




21 


— * 22 / 21/22 




For example: 






%ii 


iiQ 53 ko:i 53 — ►■ iiq 55 ~ 


'should, must' 


'-& 


SBm 53 kep 5 — »- SBm 55 ~ 


1 anxious ' 


#& 


mA: 21 iBU 21 — >-mA: 22 ~ 


'sesame oil' 



In fact, sequences of more than two syllables undergo tone sandhi under 
the same condition: 45 
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% tf *ft 


»H1? 


££* 


*££/*] 


11^* 



tsy:n 53 SBU 53 pA:n* 55 — >- tsy:n 55 si2i! 55 ~ 'special class 1 
si: 21 si: 21 lBi 21 — * si: 22 si: 22 - f often come' 

tsy: 53 si:n 53 sA:rj 53 — >- tsy: 55 si:n 55 ~ f Mr. Zhu' 46 
ts'i:n 21 yy:n 21 t'uo 21 — >- ts'i:n 22 yy :n 22 ~ a proper name 
yy:n 21 yy:n 21 ts'y:n 21 ts'y:n 21 — >- yy:n 22 yy:n 22 ts'y:n 22 ~ 

'completely' 
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